Yet Another Matcher

ثبت نشده
چکیده

Discovering correspondences between schema elements is a crucial task for data integration. Most matching tools are semi-automatic, e.g. an expert must tune some parameters (thresholds, weights, etc.). They mainly use several methods to combine and aggregate similarity measures. However, their quality results often decrease when one requires to integrate a new similarity measure or when matching particular domain schemas. This paper describes YAM (Yet Another Matcher), which is a matcher factory. Indeed, it enables the generation of a dedicated matcher for a given schema matching scenario, according to user inputs. Our approach is based on machine learning since schema matchers can be seen as clas-sifiers. Several bunches of experiments run against matchers generated by YAM and traditional matching tools show how our approach (i) is able to generate the best matcher for a given scenario and (ii) easily integrates user preferences, namely recall and precision tradeoff.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Yet Another Matcher

Discovering correspondences between schema elements is a crucial task for data integration. Most matching tools are semi-automatic, e.g. an expert must tune some parameters (thresholds, weights, etc.). They mainly use several methods to combine and aggregate similarity measures. However, their quality results often decrease when one requires to integrate a new similarity measure or when matchin...

متن کامل

Encore un outil de découverte de correspondances entre schémas XML ?

In this paper, we present YAM, a schema matcher factory. YAM (Yet Another Matcher) is not (yet) another schema matching system as it enables the generation of a la carte schema matchers according to user requirements. These requirements include a preference for recall or precision and a training data set (a set of expert correspondences or a domain of interest). YAM uses a knowledge base that i...

متن کامل

YAM: A Step Forward for Generating a Dedicated Schema Matcher

Discovering correspondences between schema elements is a crucial task for data integration. Most schema matching tools are semiautomatic, e.g., an expert must tune certain parameters (thresholds, weights, etc.). They mainly use aggregation methods to combine similarity measures. The tuning of a matcher, especially for its aggregation function, has a strong impact on the matching quality of the ...

متن کامل

Automatic integration of Heterogenous XML-schemas

Due to the XML’s flexibility and semi-structured nature, complications arise when trying to transplant data from one XML to another. Researchers have made great strides in solving the problem of integrating homogenous XML. But there are very few specifically addressing the problem of integrating heterogenous documents. We introduce XSD Matcher, a system for automatically mapping a collection of...

متن کامل

LYAM++ results for OAEI 2015

The paper presents a novel technique for aligning cross-lingual ontologies that does not rely on machine translation, but uses the large multilingual semantic network BabelNet as a source of background knowledge. In addition, our approach applies a novel orchestration of the components of the matching workflow. We demonstrate that our method outperforms considerably the best techniques in the s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009